Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

نویسندگان

  • Nicolas Chapados
  • Yoshua Bengio
چکیده

We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an application in financial portfolio management where we can train a controller to directly optimize a Sharpe Ratio (or other risk-averse non-additive) utility function. We illustrate the approach by demonstrating experimental results using a kernel-based controller architecture that would not normally be considered in traditional reinforcement learning or approximate dynamic programming. We further show that using a non-additive criterion (incremental Sharpe Ratio) yields a noisy K-best-paths extraction problem, that can give substantially improved performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MULTIPERIOD CREDIBILITIC MEAN SEMI-ABSOLUTE DEVIATION PORTFOLIO SELECTION

In this paper, we discuss a multiperiod portfolio selection problem with fuzzy returns. We present a new credibilitic multiperiod mean semi- absolute deviation portfolio selection with some real factors including transaction costs, borrowing constraints, entropy constraints, threshold constraints and risk control. In the proposed model, we quantify the investment return and risk associated with...

متن کامل

Lexicographic goal programming approach for portfolio optimization

This paper will investigate the optimum portfolio for an investor, taking into account 5 criteria. The mean variance model of portfolio optimization that was introduced by Markowitz includes two objective functions; these two criteria, risk and return do not encompass all of the information about investment; information like annual dividends, S&P star ranking and return in later years which is ...

متن کامل

Primal and dual robust counterparts of uncertain linear programs: an application to portfolio selection

This paper proposes a family of robust counterpart for uncertain linear programs (LP) which is obtained for a general definition of the uncertainty region. The relationship between uncertainty sets using norm bod-ies and their corresponding robust counterparts defined by dual norms is presented. Those properties lead us to characterize primal and dual robust counterparts. The researchers show t...

متن کامل

Portfolio Optimization with Position Constraints: an Approximate Dynamic Programming Approach

We analyze dynamic portfolio choice problems using an approximate dynamic programming (ADP) algorithm. We extend the algorithm to the case of constraints on borrowing and implement a duality-based simulation procedure for estimating bounds on the true value function. We demonstrate that the ADP solution exhibits a high degree of accuracy in the considered examples, indicating that this is a pro...

متن کامل

مدیریت پرتفوی چنددوره‌ای همراه با کنترل ورشکستگی تحت رویکرد برنامه‌ریزی پویا

Efficient portfolio management, has been attractive for financial researchers and was wished for investors from past to now. In this research, a multiperiod portfolio optimization problem for asset liability management of an investor who intends to control the probability of bankrupt is investigated. The proposed portfolio is consisting of number of risky assets, risk free asset and a type of d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JCP

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2007